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Abstract 

Let C be a finite set of n elements and 1Z = {ri, ri, . . . , r m } a family of 
m subsets of C. A subset X of 1Z verifies the Consecutive Ones Property 
(C1P) if there exists a permutation P of C such that each ri in X is an 
interval of P. A Minimal Conflicting Set (MCS) S C 7Z is a subset of 1Z 
that does not verify the C1P, but such that any of its proper subsets does. 
In this paper, we present a new simpler and faster algorithm to decide if a 
given element r £ 7Z belongs to at least one MCS. Our algorithm runs in 
0(n 2 m 2 +nm 7 ), largely improving the current 0(m 6 n 5 (m+n) 2 log(m+n)) 
fastest algorithm of [Blin et al, CSR 2011]. The new algorithm is based on 
an alternative approach considering minimal forbidden induced subgraphs 
of interval graphs instead of Tucker matrices. 

1 Introduction 

Let C = {ci, . . . , c„} be a finite set of n elements and 1Z = {n, r2, ■ ■ ■ , r m } 
a family of m subsets of C. Those sets can be seen as a m x n 0-1 matrix 
M = (7Z,C), such that the set C represents the columns of the matrix, and the 
set 1Z the rows of the matrix: each n € 1Z represents the set of columns where 
row i has an entry 1. 

A subset X of 1Z verifies the consecutive ones property (C1P) if there exists 
a permutation P of C such that each r» in X is an interval of P. Testing the con- 
secutive ones property is the core of many algorithms that have applications in 
a wide range of domains, from VLSI circuit conception through planar embed- 
dings [5] to computational biology for the reconstruction of ancestral genomes 
[Tl [21 SI El [HI • We focus on this last field in this paper. 

On real biological matrices, the C1P is rarely verified, and only some subsets 
of rows might verify the desired property.However, the combinatorics of such 
sets is difficult to handle, and a strategy to deal with them has been proposed 
in [2 [51 IHj • It consists in identifying the rows belonging to minimal conflicting 
subsets of rows that do not verify the C1P, but such that any of their row subset 
does. 
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Definition 1 A set S C U, S ^ is a Minimal Conflicting Set (MCS) if S 
does not verify the C1P, but such that VA", X <Z S , the set X verifies the C1P. 



Ci c 2 c 3 c 4 c 5 c 6 c 7 c 8 



However, it is not difficult to build examples 
of matrices such that the number of MCS is 
polynomial or even exponential in the num- 
ber of rows. 

Figure [T] shows such an example in which 
each sub set of 3 rows is a MCS. Thus, such a 
construction with m rows gives C™ = 0(m 3 ) 
MCS. Note that, on this example, a single 
row is included in 0(m 2 ) MCS. 

Figure [2j (a) shows another example 
where the number of MCS is exponential in 
the number of rows. Let k be the number of 
nodes of external rows, which are r-j,r§, and 
rg on the figure. The total number of rows is 
3k, the number of columns 2k, and the num- 
ber of MCS is 2 k since any induced chordless cycle in the row intersection graph 
of the matrix (Figure [2]- (b)) constitutes a MCS. 
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Figure 1: A matrix not verifying 
the C1P and such that each set 
of 3 rows is a MCS. 
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Figure 2: (a) A matrix not verifying the C1P and such that the number of MCS 
is exponential in the number of rows, (b) A row intersection graph of the matrix 
whose vertices correspond to the rows of the matrix, and such that there exists 
an edge between two rows and rj if n rj ^0. 

From a computational point of view, the first question that arises is the 
following: is a given row r E 1Z included in at least one MCS ? This question 
has been raised in [T] , recalled in [H [S] and recently solved in polynomial time 
0(m 6 n 5 (m + n) 2 log(m + n)) in [3]. This currently fastest algorithm is based on 
the identification of minimal Tucker forbidden submatrices [101 [B] . 

In this paper we present a new simpler 0(m 2 n 2 + nm 7 ) time algorithm for 
deciding if a given row belongs to at least one MCS and if true exhibit one. Our 
algorithm is based on an alternative approach considering minimal forbidden 
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induced subgraphs of interval graphs [7] instead of Tucker matrices. Moreover, 
our central paradigm consists in reducing the recognition of complex forbidden 
induced subgraphs to the detection of induced cycles in ad-hoc graphs, while in 
[3] only induced paths are considered. Our approach is faster and simpler, but 
a limit shared by both approaches resides in avoiding to report the number of 
MCS to which a given row belongs. 

2 MCS and Forbidden induced subgraphs 

The row-column intersection graph of a 0-1 matrix M = (7Z,C) is a vertex- 
colored bipartite graph Grc(M) whose set of vertices is 1Z U C ; the vertices 
corresponding to rows (resp. columns) are black (resp. white) ; there exists an 
edge between two rows r; G 1Z and rj e 1Z if rj n r j ^ 0, and there exists an 
edge between a row r £ 1Z and a column c e C if c e r. 

It should be noted that a column vertex (white) is only connected to row 
vertices (black). 

The neighborhood N(r) of a row r is the set of rows intersecting r, 2V(r) = 
{leK : r n £ 7^ 0} and AT(rj, r 3 ) = N(ri) n N(rj). The span L(c) of a column 
c is the set of rows containing x, L(c) = {r € 7vL : c € r}. 




23 kg e34 kg 
k>2 k > 2 



Figure 3: Forbidden induced subgraphs for the row-column intersection graph 
of M = (K,C) to verify C1P. 

Theorem 1 ([7], Theorem 4) 4 0-i matrix M = (TZ,C) verifies the C1P 
if and only if its row- column intersection graph does not contain a forbidden 
induced subgraph of the form I, II, III, IV, or V (Figure^. 

Property 1 From Theorem^ a set S C K is a MCS if the row-column inter- 
section graph Grc{S ,C) contains a subgraph of the form I, II, III, IV, or V; 
and for any T C S, Grc(TiC) does not contain a subgraph of the form I, II, 
III, IV, or V. 
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Given a MCS 5C7J,a forbidden induced subgraph contained in Grc(S,C) 
is said to be responsible for the MCS S. If this forbidden induced subgraph is 
of the form I (resp. II; III; IV; V), we simply say that S is a MCS of the form 
I (resp. II; III; IV; V). 

Definition 2 A row of a MCS S that intersects all other rows of S is called a 
kernel of S. In a forbidden induced subgraph responsible for S, any kernel of S 
constitutes a black vertex that is connected to all other black vertices. 

Property 2 Note that an induced subgraph of the form II, III, IV, or V nec- 
essarily contains at least one kernel, while an induced subgraph of the form I 
contains no kernel. 

We denote by Gr(M), the subgraph of Grc(M) induced by the set of rows 
TZ, thus containing only black vertices. 

Graph sizes. Gr(M) has m vertices and at most min(mn,m 2 ) edges, while 
Grc(M) has m + n vertices and at most min((m 4- n) 2 , m 2 n) edges. 

3 A global algorithm 

Our algorithm to decide if a row r £ TZ of a 0-1 matrix M = (TZ,C) belongs to 
at least one MCS, is based on a sequence of algorithms for finding a forbidden 
subgraph of Grc(M) responsible for a MCS containing r. It looks for forbidden 
subgraph of the form I, III, II, IV, V, in the following order: 1. MCS of type 
I, 2. MCS of size 3 (types IV or V), 3. MCS of type II, 4. MCS of type III, 
5. MCS of type IV and size larger or equal to 4, and MCS of type V and size 
larger or equal to 4. See Figure |4] for an overview. The steps 1 to 4 are based 
on straightforward brute-force algorithms, while the two last steps relies to a 
reduction to the detection of induced chordless cycles in ad-hoc graphs. 
In the following, we simply write Grc(M) as G and Gr(M) as Gr. 

3.1 Step 1: Forbidden induced subgraph I 

We first test if r belongs to a MCS of the form I. If it is true, then r belongs 
to an induced chordless cycle of G of length at least 4 containing only black 
vertices. Such a cycle exists in G if and only if is also a chordless cycle in Gr 
since Gr is the subgraph of G induced by the set of rows TZ. Thus it suffices to 
search for an induced chordless cycle in Gr. 

Proposition 1 Algorithm Check_I is correct and runs in worst case 0(m 5 ) 
time. 

Proof. The correctness of Algorithm CheckJ comes from the fact that, r is 
contained in a MCS of the form I if and only if r belongs to an induced chordless 
cycle of Gr of length at least 4 whose set of vertices S constitutes the MCS 
(Figure|4jl). A P4 of Gr is an induced chordless path of Gr containing 4 vertices. 
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Figure 4: The different steps of the algorithm: in each case, when row r has 
a specific location in the forbidden induced subgraph that is looked for, this 
location is indicated in bold character. Other rows and columns of the forbidden 
induced subgraph are indicated in grey color characters. 



In this case, Algorithm CheckJ returns such a set of vertices since an induced 
chordless cycle of Gr of length at least 4 containing r is a P4 containing r whose 
extremities are linked by a chordless path in the subgraph of G that does not 
contain the neighborhood of the internal vertices of the P4. This set S cannot 
contain a smaller subset of rows that is a MCS, as no subset of S can be a MCS 
of the form I, or a MCS of any other form because of Property [2j 

Algorithm CheckT might be implemented in 0(m 5 ). The test performed 
on a give P4 containing r (lines 2-5 of the algorithm) can be achieved in 
0(min(mn, m ) + mlogm) as follows: removing the neighborhood of its inter- 
nal vertices might be done in min(mn,m 2 ) time, and finding a chordless path 
between the two extremities might be performed using Dijkstra's algorithm in 
0(min(mn, m ) + mlogm) time. Enumerating all P4 containing r might be 
done in time 0(m 3 ) using a BFS from r stopping at depth 4. Eventually, the 
whole algorithm is in 0(m 3 (min(mn, m 2 ) + mlogm)) = 0(m 5 ) time. □ 
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Algorithm 1 ChcckJ (r, G R ) - 0(m 5 ) 



Input: a row r, the subgraph Gr. 

Output: returns a MCS S given by a forbidden induced subgraph of 
the form I containing r if such a MCS exists, otherwise returns "NO". 



1: for any P4 of Gr containing r do 

2: Consider the graph G" obtained from Gr after removing the two internal 
vertices of the P4 and their neighborhood from the graph, and consider 
the extremities and rj of the P4 

3: if there exists a r^j-path in G' then 

4: find a chordlcss path P in this graph linking rj and rj. 

5: return the set of vertices of the P4 plus the set of vertices of P 

6: end if 

7: end for 

8: return "NO" 



Precomputation. In the following steps, we assume that the following pre- 
computations have been achieved: 

• For any triplet of rows (r,ri,rj) that are pairwise intersecting, i.e each 
couple is an edge in G, r — (rj U rj) and (rj n rj) — r are precomputed ; 

• Two rows r, and rj are overlapping if rj n r j 7^ and r, — rj 7^ and 
rj — Ti 7^ 0. The overlapping relation between any couple of rows is 
precomputed ; 

• For any quadruplet of rows (r, r i} rj,rk) such that rj, rj, and r/j overlap r, 
r — (rj n n r k ) is precomputed. 

All those precomputations can simply be performed in C>(m 4 n) time using 
straightforward algorithms, that is, scanning the n columns of the input matrix 
for each triplet or quadruplet of rows. 

3.2 Step 2: Forbidden induced subgraph responsible for a 
MCS of size 3 

We test here if r belongs to a MCS of size 3. A MCS of size 3 is necessarily 
caused by a forbidden induced subgraph of the form IV or V. As a consequence, 
the following property is immediate. 

Property 3 A MCS of size 3 is always composed of 3 rows that are pairwise 
overlapping. 
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Algorithm 2 CheckTV_V_3 (r, G) - 0{m 2 ) 



Input: a row r, the row-column intersection graph G. 

Output: returns a MCS S of size 3 given by a forbidden induced subgraph of 
the form IV or V containing r if such a MCS exists, otherwise returns "NO". 



l: for any couple {r^rj) of black vertices that both overlap r, and overlap 
each other do 

2: if r — (r*j U Tj) 7^ and r,-(rU r^) 7^ and Tj — (r U r,) 7^ then 
3: return {r, rj , } 
4: end if 

5: if {ri fl r,) — r 7^ and (r n r 3 ) — r, 7^ and (r n r, ; ) — r j 7^ then 

6: return {r, r$ , } 

7: end if 

8: end for 

9: return "NO" 



Proposition 2 Algorithm Check-IV-VS is correct and runs in 0(m 2 ) time. 

Proof. The correctness of Algorithm Check JV_V_3 comes from the fact that, r 
is contained in a MCS of size 3 if and only if this MCS is caused by a forbidden 
induced subgraph of the form IV or V (Property [3]). Thus, r should belong to a 
triplet of rows (r, rj, r,) that are pairwise overlapping, and satisfy the conditions 
given in: 

• either, line 2 of the algorithm to produce a forbidden induced subgraph of 
the form IV (left-end graph in Figure [4jlV_V_3) , 

• or, line 5 of the algorithm to produce a forbidden induced subgraph of the 
form V (right-end graph in Figure |4jlV_V_3). 

In both cases, Algorithm CheckTV_V_3 returns the set {r,ri,rj} as a MCS if 
such a set of rows exists. This set cannot contain a smaller subset of rows that 
is a MCS as 3 is the minimum size of any MCS. 

Algorithm Check_IV_V_3 runs in 0(m 2 ) time since, given r, there might be 
0(m 2 ) couples (r^, rj) on which the tests performed (lines 2-8 of the algorithm) 
might be achieved in O(l), thanks to the precomputations that have been done. 
□ 

3.3 Step 3: Forbidden induced subgraph II 

We test here if r belongs to a MCS of the form II, with the assumption that r 
is not contained in any MCS of size 3. Note that such a MCS is of size 4. 
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Algorithm 3 CheckJL4 (r, G)-0(m 3 ) 



Input: a row r, the row-column intersection graph G. 
Assumption: r is not contained in a MCS of size 3. 

Output: returns a MCS S given by a forbidden induced subgraph of 
the form II containing r if such a MCS exists, otherwise returns "NO". 



l: for any triplet (rj, rj,rk) of black vertices such that ri,rj, r k overlap r do 

2: if there are no edges (r i} rj), (ri,r k ), and (rj,r k ) in G then 

3: return {r,ri,rj,r k } 

4: end if 

5: end for 

6: for any triplet (r^r,-, r k ) of black vertices of such that overlaps r, and 
Tj , r k overlap fj do 

7: if there are no edges (r, ry), (r,r k ), or (rj,r k ) in G then 

8: return {r,, r, fj, ?•>} 

9: end if 

10: end for 

11: return "NO" 



Proposition 3 Algorithm Check_II_4 is correct and runs in 0(m 3 ) time. 

Proof. The correctness of Algorithm Check_II_4 comes from the fact that, if r 
belongs to a MCS of the form II, then r should belong to a quadruplet of rows 
(r,ri,rj,r k ) such that one these rows is a kernel, and the three other rows do 
not intersect each other. Thus, the row r is: 

• either, a kernel of the MCS, tested in lines 1-5 of the algorithm (left-end 
graph in Figure|4jll_4), 

• or, not a kernel of the MCS tested in lines 6-10 of the algorithm (right-end 
graph in Figure |4jll_4). 

In both cases, Algorithm CheckJI_4 returns the set {r,ri,rj,r k } as a MCS if 
such a set of rows exists. This set cannot contain a smaller subset of rows that 
is a MCS as this subset would be a subset of 3 rows that cannot satisfy Property 

131 

Algorithm CheckTV_V_4 runs in 0(m 3 ) time since all the tests performed 
on a given triplet (r^, r,-, r k ) in lines 2-4 and 7-9 of algorithm can be achieved in 
O(l), and given r there might be 0(m 3 ) such triplets. □ 
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3.4 Step 4: Forbidden induced subgraph III 

We test here if r belongs to a MCS of the form III, with the assumption that r 
is not contained in a MCS of size 3. Note that such a MCS is of size 4. 

Algorithm 4 ChcckJIL4 (r, G)-0(m 3 ) 



Input: a row r, the row-column intersection graph G. 
Assumption: r is not contained in a MCS of size 3. 

Output: returns a MCS S given by a forbidden induced subgraph of 
the form III containing r if such a MCS exists, otherwise returns "NO". 



l: for any triplet (r^, rj,r k ) of black vertices such that r i} rj,r k overlap r, and 

r - (n n Tj n r k ) do 
2: if there are no edge (rj, r k ) in G, and (riflrj) — r 7^ 0, and (rjCirk) — r ^ 
then 

3: return {r,rj,rj,r fe } 
4: end if 
5: end for 

6: for any triplet (ri,rj,r k ) of black vertices of such that rj overlaps r, and 
rj,r k overlap n, and rj — (rnrjflrfe) 7^ 0, and {rj,rj,rfe} is not a MCS do 
7: if there are no edge (r, rfe) in G, and (rHrj) — 7^ 0, and (rj-Hrfe) — ^ 7^ 
then 

8: return {n,r,rj,r k } 
9: end if 
10: end for 

11: for any triplet {ri,rj,r k ) of black vertices of such that ti overlaps r, and 

rj,r k overlap r i} and r, — (r fl rj fl r k ) ^ do 
12: if there are no edge (rj,r k ) in G, and (r^-nr) — 7^ 0, and (rr\r k ) — r, 7^ 
then 

13: return {r u r ,r, r k } 

14: end if 

15: end for 

16: return "NO" 



Proposition 4 Algorithm Check-III-4 is correct and runs in 0(m 3 ) time. 

Proof. The correctness of Algorithm Chcck_IIL4 comes from the fact that, r 
belongs to a MCS of the form III if and only if r should belong to a quadruplet 
of rows (r, rj, rj, r^) included in an induced subgraph of the form III such that 
two of these rows are kernels of the subgraph, and one of these kernels contains 
a column of the induced subgraph that is not shared with any of the other rows. 
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Let us call this kernel kerneLl, and the other kernel kernel_2. For example in 
the left-end graph in Figure [4jlIL4, kerneLl=r, and kerneL2=7\,. 
Thus, the row r is: 

• either, kerneLl, tested in lines 1-5 of the algorithm (left-end graph in 
Figure [2JIIL4), 

• or, not a kernel, tested in lines 6-10 of the algorithm (middle graph in 
Figure |4]IIL4). 

• or, kerneL2, tested in lines 11-15 of the algorithm (right-end graph in 
Figure |4]IIL4). 

In the first, and third cases, the set {r^, rj,r k } cannot be a MCS because such a 
set cannot satisfy Property[3]ln all cases, Algorithm Check_IIL4 returns the set 
{r, r i; rj, r-fc} as a MCS if such a set of rows exists, and {r i: rj,rk} is not a MCS 
(in the second case). Since we made the assumption that r is not contained in 
a MCS of size 3, there cannot exists a smaller subset of {r^, r,-, r k } containing r 
that is a MCS. 

Algorithm Check_III_4 runs in 0(m 3 ) time using a similar proof as the com- 
plexity proof for Check_IV_V_4: all the tests performed by the algorithm (lines 
2-4, 7-9, and 12-14 of the algoritms) on a given triplet (r i: rj,rk) are achieved 
in O(l) thanks to the precomputations, and given r there might be 0(m 3 ) such 
triplets. □ 

3.5 Step 5: Forbidden induced subgraph IV 

We test here if r belongs to a MCS of the form IV, with the assumption that r 
is contained, neither in a MCS of size 3, nor in a MCS of type I. Depending on 
whether the size of the MCS is 4 or larger than 4, we describe two algorithms. 

3.5.1 MCS of size 4 

We first test if r belongs to a MCS of the form IV of size 4. We look for a triplet 
of rows (rj, rj , r k ) such that the set {r, r<, Tj , r k } is a MCS of the form IV (Figure 
|4jlV_4). In an induced subgraph of the form IV containing 4 rows {r, r,-, rj, r k }, 
two rows are kernels, and in that case, r is either a kernel of the MCS, or not. 
If r is a kernel, then it is either a kernel -called kerneLl- containing a column 
of the induced subgraph that is not shared with any of the other rows , or not 
-called kcrnel_2-. For example, in the left-end graph in Figure 4.IV-4, the two 
kernel are the two central black vertices of the graph: the top one is a kerneLl, 
and the bootom one a kernel_2. Algorithm Check_IV_4 looks for each of these 
configurations: 

• r is a kerneLl, tested in lines 1-5 of the algorithm; 

• r is not a kernel, tested in lines 6-10 of the algorithm; 

• r is a kernel_2, tested in lines 11-15 of the algorithm. 
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The proof of the correctness of Algorithm Check_IV_4 is similar to the proof for 
Algorithm Check JIL4. 



Algorithm 5 Check_IV_4 (r , G) - d(m 3 ) 



Input: a row r, the row-column intersection graph G. 
Assumption: r is not contained in a MCS of size 3. 

Output: returns a MCS S of size 4 given by a forbidden induced subgraph 
of the form IV containing r if such a MCS exists, otherwise returns "NO". 



l: for any triplet (rj, rj,r k ) of black vertices such that r t , rj, r k are connected 

to r, and r — (rj n rj n r k ) 7^ do 
2: if there are no edge (r i7 rfe) in G, and (rj n rj) 7^ 0, and (rj n rfe) 7^ 0, 

and ^ — (r U rj) 7^ 0, and r t -(rU rj) 7^ then 
3: return {r,rj,rj,r fe } 
4: end if 
5: end for 

6: for any triplet (ri,rj,r k ) of black vertices of such that r t is connected to 
r, and rj, r fe are connected to rj, and ^-(rfl rj n r fc ) 7^ 0, and {r i} rj,r k } 
is not a MCS do 

7: if there are no edge (r, r k ) in G, and (r n rj) 7^ 0, and (rj n rfe) 7^ 0, and 

r - (r» U rj) ^ 0, and r k - (n U rj) ^ then 
8: return {n,r,rj,r k } 
9: end if 
10: end for 

11: for any triplet (r,,rj,rfc) of black vertices of such that rj is connected to 

r, and rj, r k are connected to rj, and rj — (r n rj n rfe) 7^ do 
12: if there are no edge (rj 7 r k ) in G, and (rj n r) 7^ 0, and (r n rfe) 7^ 0, and 

rj — (r U rj) 7^ 0, and r k — (r U rj) 7^ then 
13: return {rj,rj,r,r fe } 
14: end if 
15: end for 
16: return "NO" 



Proposition 5 Algorithm Check-IV-4 is correct and runs in 0(m 3 ) time. 

Proof. The proof for Algorithm Check_IV_4 is similar to the proof for Algorithm 
Check_III_4. □ 

3.5.2 MCS of size larger than 4 

We test here if r belongs to a MCS of the form IV of size larger than 4. A MCS 
of the form IV of size larger than 4 contains one and only one kernel. Depending 
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on whether r is the kernel or not, we distinguish two cases here. 
Case 1: If row r is the kernel of the MCS 

Algorithm CheckJVfc recovers a MCS S of the form IV of size larger than 4 
containing r as a kernel, with the assumption that r is not contained in a MCS 
of size 3 (Figure |4jlVfc). The principle of the algorithm relies in first choosing 
the column c € C, of the forbidden induced subgraph of type IV responsible for 
iS, that is contained in r, and in no other row of the MCS (see Figure g}lV fe ). 
Next, it considers the subgraph H of G induced by the set of black vertices 
(rows) that are neighbors of r, but do not contain the column c. We denote 
this subgraph by H = G[N(r) — L(c)]. Then, it looks for a set of rows Q, 
constituting a chordless path in H, such that {r} U Q is a MCS of the form IV. 



Algorithm 6 Check_IV fc (r, G) - 0(nm 2 ) 



Input: a row r, the row-column intersection graph G. 
Assumption: r is not contained in a MCS of size 3. 

Output: returns a MCS S of size larger that 4 given by 
a forbidden induced subgraph of the form IV whose ker- 
nel is r if such a MCS exists, otherwise returns "NO". 



1: for any column c£r do 
2: H = G[N(r) - L(c)] 

3: for any connected component C of H do 

4: pick a a couple (r i; rA of black vertices in C that satisfies 1) r, and rj 

are not connected, and 2) ri,fj overlap r. 
5: find a chordless path P in C linking n and Vj 

6: pick the smallest subpath Q of P linking two vertices r\ and r'p such 

that the couple (r^r'j) also satisfies 1) and 2) 
7: return {r} U Q 
8: end for 
9: end for 
10: return "NO" 



Proposition 6 Algorithm Check^IVk is correct and runs in 0(nm 2 ) time. 

Proof. Note that, if the MCS exists, then all the rows belonging to the MCS, 
except r, belong to a same connected component of H. Thus, in each connected 
component of H, the algorithm looks for a chordless path Q linking two vertices 
ri,rj satisfying 1) and rj are not connected, and 2) r^r^ overlap r, and 3) 
Q does not contain any smaller subpath satisfying conditions 1) and 2). These 
conditions are necessary and sufficient for the set {r} U Q to form the rows of 
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a induced subgraph of the form IV. The set {r} U Q cannot contain a subset 
that is a MCS as such a smaller MCS should be: 

• either a MCS of size 3 including r, which impossible by assumption, 

• or a MCS of type II or III necessarily including r as kernel, 

• or a MCS of type IV and size larger than 3 having r as kernel. 

The two last cases are also impossible, since Q would not have satisfy condition 
3) in these cases. 

Next, there might be n columns c E r and up to m 2 couples (rj, rj) of black 
vertices to test before finding a valid couple (Vi,rj) satisfying the conditions 
in line 4 of the algorithm. Up to this point, the complexity is in 0(nm 2 ). 
Assume now that such a couple exist. Then finding a chordless path between 
n and Tj might be done by searching for a shortest path between r, and Vj in 
the connected component C using Dijkstra's algorithm, which thus requires at 
worst 0(min(mn, m 2 ) + to log m) time. The path is of length at most m, and 
thus identifying r\ and r'j is bounded by testing each pair on this path in C, 
which requires at worst 0{m 2 ) time. Thus, in total, the algorithm is 0(nm 2 ) 
worst case time. □ 

Case 2: If row r is not the kernel of the MCS 

Algorithm Check JV P recovers a MCS S of the form IV of size larger than 4 
containing r, but not as a kernel, with the assumptions that r is not contained 
in a MCS of size 3, and r does not belong to an induced chordless cycle of Gr 
(Figure 4.IV P ). The principle of the algorithm consists in first choosing the 
kernel a of S among the black vertices (rows) neighbors of r, and the column 
c E C, of the induced subgraph of type IV responsible for S, that is contained 
in a, but in no other row of the MCS. (see Figure |ij I V p ). Next, the algorithm 
calls Algorithm CheckJV to look for the MCS S with r, a, c, and G given as 
parameters. 

Algorithm CheckJV is called in Algorithm CheckJV p . It recovers a MCS 
S of the form IV of size larger than 4 containing r, given the row r, the kernel 
a of the MCS 5, and the column c G C, of the induced subgraph of type IV 
responsible for S, that is contained in a, but in no other row of the MCS (Figure 
0IVp). 

Proposition 7 Algorithm Check-IV p is correct, and runs in 0(nm e ) time. 
Proof. 

The correctness and the complexity of CheckJV p follows directly from the 
the correctness and the complexity of Algorithm CheckJV that is called in 
Algorithm Check JV p . 

The correctness of CheckJV comes from the fact that, r does not belong 
to any chordless cycle in the graph C computed at line 2 of the algorithm by 
assumption. Then at line 6 of the algorithm, any chordless cycle in the graph 
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Algorithm 7 CheckJV p (r, G) - 0(nm 6 ) 



Input: a row r, the row-column intersection graph G. 
Assumption: r is not contained in a MCS of size 3. 

r does not belong to an induced chordless cycle of Gr. 
Output: returns a MCS S of size larger that 4 given by a for- 
bidden induced subgraph of the form IV containing r whose ker- 
nel is not r if such a MCS exists, otherwise returns "NO". 



1: for any black vertex a e N(r) do 

2: for any column c € a — r do 

3: return CheckJV(r, a, c, G) 

4: end for 

5: end for 

6: return "NO" 



Algorithm 8 CheckJV (r, a, c, G)- 0{m 5 ) 



Input: two rows r and o, and a column c € a such that r € (N(a) — L(c)) . 
Assumption: r is not contained in a MCS of size 3. 

r does not belong to an induced chordless cycle of Gr. 
Output: returns a MCS S of size larger that 4 given by a for- 
bidden induced subgraph of the form IV containing r and a, 
whose kernel is a if such a MCS exists, otherwise returns "NO". 



1: H = G[N(a) - L(c)] 

2: let C = (V~c,Ec) be the connected component of H to which r belongs. 
3: let V a be the set of vertices V a = {u e Vc '■ u-a^fl}. 
4: let E a be the set of edges E a = {(u, v) G V a 2 : u(lv = 0}. 
5: let D = (Vd,E d ) be the graph such that Vd = Vc and E D = E c U i? a . 
6: Q = Check J (r, D R ) 
7-. it Q ^ "NO" then 
8: return {a} U Q 
9: end if 
10: return "NO" 
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D containing vertex r necessarily contains at least one edge (ri,rj) belonging 
to the set E a . The number of edges belonging to the set E a in such a chordless 
cycle Q cannot be greater than 1 as any couple of such edges in the chordless 
cycle would induce a chord. Indeed, if Q contains more than one edge belonging 
to E a , any two such edges would have to extremities in V a , one from each of 
the two edges, that are not connected in the graph C. These extremities would 
thus be linked by an edge in E a , creating a chord for the cycle Q in the graph 
D. 

Therefore, the set of vertices of the chordless cycle Q induces a chordless 
path in G such that each vertex of Q is connected to vertex a by definition of 
the graph H, and the extremities and rj of Q satisfy 1) and rj are not 
connected in G, and 2) ri,rj overlap r, and 3) Q does not contain any smaller 
subpath satisfying conditions 1) and 2). These conditions are necessary and 
sufficient for the set {a} U Q to form the rows of an induced subgraph of the 
form IV, and this set cannot contain a smaller MCS since such a MCS would 
be: 

• either a MCS of size 3 including a, 

• or a MCS of type II or III necessarily including a as kernel, 

• or a MCS of type IV and size larger than 3 having a as kernel. 

The 3 cases are impossible, since they would induce a chord from the set E a in 
the chordless cycle induced by Q in the graph D. 

Algorithm CheckTV calls Algorithm CheckT. Both algorithms have the 
same time complexity in 0(m 5 ) time. It follows immediately that Algorithm 
CheckJVp runs in Onm 6 time. □ 

3.6 Step 6: Forbidden induced subgraph V 

We test here if r belongs to a MCS of the form V, with the assumption that 
r is contained neither in a MCS of size 3, nor in a MCS of type I. Depending 
on whether the size of the MCS is 4, 5 or larger than 5, we describe three 
algorithms. 

3.6.1 MCS of size 4 or 5 

We first test if r belongs to a MCS of the form V of size 4 or 5. For a MCS of size 
4, we look for a triplet of rows (r^ , rj , r^ ) such that the set (r, , rj , r^ ) is a MCS 
of the form V. In such a case, we look for an induced subgraph responsible for 
the MCS, containing r,ri,rj,r^ as four black vertices pairwise connectedr, and 
we can pick three different couples of r,ri,rj,rk such that each couple shares a 
column (white vertex) that is not shared with the two other of the MCS (see 
Figure |4jV_4). 

Proposition 8 Algorithm Check-V-4 is correct and runs in 0(m 3 ) time. 
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Algorithm 9 Check_V_4 (r , G) - 0(m 3 ) 



Input: a row r, the row-column intersection graph G. 
Assumption: r is not contained in a MCS of size 3. 

Output: returns a MCS S of size 4 given by a forbidden induced subgraph 
of the form V containing r if such a MCS exists, otherwise returns "NO". 



1: for any triplet (rj, rj,rk) of black vertices such that r i} rj, are connected 

to r, and are pairwise connected do 

2: if (ri,rj,rk) is not a MCS, and (r n rj) — (rj U rfe) 7^ 0, and (r^ n rfe) — 

(rUr.) 7^ then 

3: if n rj) - (r U r fe ) ^ then 

4: return {r, r j , r^- , r fc } 

5: end if 

6: if (r nrj) - (rj Ur fe ) ^ then 

7: return {r l ,r,r j ,r k } 

8: end if 

9: end if 

10: end for 

11: return "NO" 
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Proof. Algorithm Check_V_4 looks for an induced subgraph with 4 black vertices 
{r, Ti, rj, rfe}, that are pairwise connected to each other. These 4 black vertices 
should be such that there exist three different couples of vertices among them, 
such that two couples are disjoint and the third one (called couple_kernel) over- 
laps the two first, and the 2 rows of each of these couples share a column that is 
not shared with the two other rows of the set. In this case, if {fj, r 3 -, r k } is not 
a MCS, then the subgraph induced by {r,ri,rj,rk} and the 3 columns (white 
vertices) connected to the 3 couples of rows is of the form V, and is responsible 
for a MCS {r,ri,rj,rk}. Algorithm Check_V_4 looks for two cases, depending 
on whether r belong to couple_kernel (lines 3-5), or not (lines 6-8). 

Next, all the tests performed by Algorithm Check_V_4 (lines 2-9 of the algo- 
ritm) on a given triplet (rj,r 3 ,rfc) are achieved in O(l) thanks to the precom- 
putations, and given r there might be 0(m 3 ) such triplets. Thus, Algorithm 
Check_V_4 runs in 0(m 3 ) time. □ 

Next, for a MCS of size 5, we look for a quadruplet of rows (r i7 rj, r^, r{) 
such that the set {r,ri,rj,rk,ri} is a MCS of the form V (Figure |4jV_5). Algo- 
rithm Check_V_5 looks for an induced subgraph of the form V, consisting of 5 
rows (black vertices) r,ri,rj,rk,ri that are pairwise connected, except for a on 
missing edge, say (r„, r b ) in {r, r 4 , r 3 ,r k ,ri} x {r, r*, ry, r-fc, rj}, and three columns 
(white vertices) satisfying the configuration of Figure |4jV_5. 

Proposition 9 Algorithm Check_V-5 is correct and runs in 0(m 4 ) time. 

Proof. Algorithm Check_V_5 looks for an induced subgraph with 5 black vertices 
{r, r,, rj, rfc, r{\, that are pairwise connected, except for one missing edge (r a , r b ) 
in {r, n, rj,rk,ri} x {r, Ti,rj,rk,ri}. The 4 black vertices that belong to the set 
with r, should correspond to a set of rows that is C1P. Moreover, there should 
exist two particular rows (black vertices) of the set, with three columns (white 
vertices) that satisfy the conditions on line 4 of the algorithm in order to fit the 
configuration depicted in Figure [4jV_5. 

Next, all the tests performed by Algorithm Check_V_5 (lines 2-8 of the al- 
goritm) on a given quatruplet (ri,Vj,rk,ri) are achieved in O(l) thanks to the 
precomputations, and given r there might be 0(m 4 ) such triplets. Thus, Algo- 
rithm Check_V_5 runs in 0(m 4 ) time. □ 

3.6.2 MCS of size larger than 5 

A MCS of the form V of size larger than 5 contains exactly two kernels. De- 
pending on whether r is a kernel or not, we distinguish two cases. 

Case 1: If row r is a kernel of the MCS 

Algorithm CheckJVfc recovers a MCS S of the form V of size larger than 
5 containing r as a kernel, with the assumption that r is not contained in a 
MCS of size 3, or 4 (Figure [4] Vfe). The principle of the algorithm is similar 
to Algorithm Check JV^. It relies in first choosing the second kernel a of the 
MCS, and the column c, of the induced subgraphof type V responsible for S, 
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Algorithm 10 Check_V_5 (r , G) - 0(m 4 ) 



Input: a row r, the row-column intersection graph G. 
Assumption: r is not contained in a MCS of size 3 or 4. 

Output: returns a MCS S of size 5 given by a forbidden induced subgraph 
of the form V containing r if such a MCS exists, otherwise returns "NO". 



l: for any quadruplet (r{,rj,rk,ri) of black vertices such that r,r{,rj,r k ,ri 
are pairwise connected, except for one edge (r a ,rb) in {r,ri,rj,rk 7 ri} x 
{r,ri,rj,r k ,ri} missing do 

2: if {ri,rj,rk,ri} is C1P then 

3: for any pair (a, b) in ({r.r^r^rk^ri} - {r a ,r b }) x ({r,ri,r v r k ,ri} - 
{r a ,r b }) do 

4: if (a n b) - U({r,r i ,r j ,r fe ,r/} - {a, 6}) ^ 0, and (r k f~l a) - 

U({r,ri,rj,r fe ,n} - {r fc ,a}) 7^ 0, and (r; n 6) - u({r, r 4 , r., , r fc , r ; } - 
{r ; ,6}) ^0then 

5: return {r,rj,rj,7-fc,rj} 

6: end if 

7: end for 

8: end if 

9: end for 
10: return "NO" 
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that is contained in both r and a, but in no other row of the MCS (see Figure 
[IJVfc). Next, it considers the subgraph H of G induced by the set of black 
vertices (rows) that are neighbors of r and a, but do not contain c. We denote 
this subgraph by H = G[N(r,a) — L(c)]. Then, it looks for a set of rows Q, 
constituting a chordless path in H, such that {r} U Q is a MCS of the form V. 



Algorithm 11 Check_V fc (r, G) - 0(n 2 m 2 ) 



Input: a row r, the row-column intersection graph G. 

Assumption: r is not contained in a MCS of size 3, or 4. 

Output: returns a MCS S of size larger that 5 given by a 

forbidden induced subgraph of the form V such that r is one 

of its kernel, if such a MCS exists, otherwise returns "NO". 



l: for any black vertex a € N(r) do 

2: for any column c e (r f~l a) do 

3: H = G[N(r, a) — L(c)] 

4: for any connected component C of H do 

5: pick a a couple (n, Tj) of black vertices in C that satisfies 1) n and 
Tj are not connected, and 2) (r^ n r) — a ^ 0, and 3) (rj n a) — r ^ 0. 

6: find a chordless path P in C linking rj and 

7: pick the smallest subpath Q of P linking two vertices r[ and r'j , such 
that the couple (r£,r'-) also satisfies 1) and 2) and 3) 

8: return {r} U Q 

9: end for 

10: end for 

11: end for 

12: return "NO" 



Proposition 10 Algorithm Check-V c is correct and runs in 0(n 2 m 2 ) time. 

Proof. The proofs are similar to the proofs for the correctness and the complexity 
of Algorithm CheckJV c as the two algorithms are based on the same principle. 
However, here the complexity is multiplied by a factor n due to considering all 
black vertices a € N(r). □ 

Case 2: If row r is not a kernel of the MCS 

Algorithm Check_V p recovers a MCS S of the form V of size larger than 5 
containing r, but not as a kernel, with the assumptions that r is not contained 
in a MCS of size 3 or 4, and r does not belong to an induced chordless cycle of 
G R (Figure |4|V). 
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The principle of the algorithm is similar to the principle of Algorithm CheckJV; 
It consists in first choosing the two kernels (a, b) of S among the black vertices 
(rows) neighbors of r, and the column c, of the induced subgraph responsible 
for S, that is contained in both a and b, but in no other row of the MCS. Next, 
the algorithm calls Algorithm Check.V to look for the MCS S with r, (a, b), c, 
and G given as parameters. 

Algorithm 12 Check_V p (r, G) - 0(nm 7 ) 



Input: a row r, the row-column intersection graph G. 
Assumption: r is not contained in a MCS of size 3, 4. 

r does not belong to an induced chordless cycle of Gr. 
Output: returns a MCS S of size larger that 5 given by a for- 
bidden induced subgraph of the form V containing r, but not 
as a kernel, if such a MCS exists, otherwise returns "NO". 



l: for any couple of connected black vertices (a, b) e N(r) 2 do 

2: for any column c G (a n b) — r do 

3: return Check_V (r, (a, b), c, G) 

4: end for 

5: end for 

6: return "NO" 



Algorithm Check A" is called in Algorithm Check _V p . It recovers a MCS S 
of the form V of size larger than 5 containing r, given the row r, the kernels a 
and b of the MCS, and the column c, of the induced subgraph responsible for 
S, that is contained in a and 6, but in no other row of the MCS. 

Proposition 11 Algorithm Check-V p is correct and runs in 0(nm 7 ) time. 

Proof. In order to prove the correctness and the complexity of Algorithm 
Check A^j,, we need to prove the correctness and give the complexity of Al- 
gorithm Check A^ that is called in Check_V p . 

The correctness of Check_V comes from the fact that r does not belong 
to any chordless cycle in the graph C computed at line 2 of the algorithm by 
assumption. Let Q be a chordless cycle in the graph D containing vertex r, 
computed at line 9 of the algorithm. Since r does not belong to an induced 
chordless cycle of the C by assumption, then Q necessarily contains at least one 
edge belonging to the set Eab U E a U E^. 

We first give two trivial but useful properties for the remaining of the proof: 

(i) For any two edges of Q, there always exists two extremities u and v of 
these edges, one in each edge, that are not disjoint in the graph C, i.e 

uC\v = % 
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Algorithm 13 Check.V (r, (a,b), c, G)- 0(m 5 ) 



Input: three rows r, a and b, and a column c G a f] b such that 
re (JV(o,6)-L(c)) . 

Assumption: r is not contained in a MCS of size 3, 4 or 5. 

r is not contained in a MCS of type IV. 

r does not belong to an induced chordless cycle of Gr. 
Output: returns a MCS S of size larger that 5 given by a forbidden 
induced subgraph of the form V containing a, b, and r, and whose 
kernels are a and b, if such a MCS exists, otherwise returns "NO". 



l: H = G[N(a,b) - L(c)] 

2: let C — (Vc,Ec) be the connected component of H to which r belongs. 

3: let Va be the set of vertices V A = {u e V c : (u n a) - b ^ 0}. 

4: let Vb be the set of vertices V B — {v e V c ■ (v n b) - a ^ 0}. 

5: let Eab be the set of edges Eab = {( u , v), u e Va, u € Vg : M H v = 0}. 

6: let V a be the set of vertices V a = {u £ Vc ■ u-a^D}, and E a be the set 
of edges E a = {(u, v) e V a 2 : !in» = 0}. 

7: let Vt be the set of vertices Vj, = {u € Vc : u — b ^ 0}, and i?b be the set 
of edges E b = {(u, v) e V h 2 : un» = 0}. 

8: let D = (V D , E D ) be the graph such that V D = V c and E D = E c U E AB U 
E a UE b 

9: Q = Check J (r, £r) 
10: if Q ^ "NO" then 
11: return {a, b} Li Q 
12: end if 
13: return "NO" 
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(ii) V A CV b , and V B C V a . 



We also prove the following useful property: 

(iii) V a C (Vg U H) and C (Va U 14). Let x G V" a , there exists c such that 
c G ir and c a. Then, either c G" b in which case x G Vb, or c G b, which 
implies that x G Vb- The proof is similar for Vb C (Va U V a ). 

We now prove that the cycle Q necessarily contains at most one edge of the 
set Eab U E a (J E - Indeed, if Q contains two edges of Eab U E a U Eb, let u, w 
be two disjoint extremities of these edges (Property (i)). We can distinguish 
7 cases according to the belonging of u and v to the sets Va, Vb, V a and V b , 
and we show in the following that, in all these cases, a chord is induced in the 
chordless cycle Q in the graph D: contradiction. 

1. If (u,v) G V\ (resp. (u,v) G Vj), then from Property (ii), (u,v) G V b 2 
(resp. (u, v) G V 2 ), and thus (u, v) G Eb (resp. (u,v) G E a ). 

2. If (u,v) G V a 2 (resp. (u, v) G V h 2 ), then (u,t;) G E a (resp. (u, w) G E b ). 

3. If (u, v) G Va x Vb (or the symmetric), then (u,v) G -Eab- 

4. If (u,f) G Va x V a (or the symmetric), then from Property (iii), (u, v) G 
Va x Vg or (u,v) G Va x Vb, and thus (u, u) G Eab or (u, w) G Eb from 
cases 3. and 6. 

5. If (u,v) G Vb x Vf, (or the symmetric), then from Property (iii), (it, u) G 
Vb x Va or (u, u) G Vb x y a , and thus (w,v) G Eab or ( u , v ) G E a from 
cases 3. and 6. 

6. If (u, v) G Va x Vb (resp. (u,f) G Vg x V a ) (or the symmetric), then from 
Property (ii), (u, v) G V b 2 (resp. (u, v) G V^ 2 ), and thus (u, v) G E& (resp. 
(«,«) G E a ). 

7. If (m,w) G V a x Vb (or the symmetric), then from Property (iii), (m, v) G 
V;, x Vf, or (u,v) G Vb x and thus (u, u) G E b or (u,w) G E A b U E a 
from cases 1 and 5. 

In consequence, there exits at most one edge, and then exactly one edge of the 
set Eab U E a U Eb in the cycle Q in the graph D. Next, let (ri,rj) be the 
only edge of Q belonging to Eab U E a U E b . We show that (n, r 3 ) £ E a U E b . 
Indeed, if (r i} rj) G E a (resp. (r^r^) G E b ), then the set {a}UQ (resp. {&}UQ) 
satisfies the conditions to be a MCS of type IV with a (resp. b) as kernel, which 
is impossible by assumption. 

So, we have (ri,rj) G Eab — (E a U Eb). Finally, removing the edge (ri,rj) 
from the cycle yields a chordless path Q in G containing r such that each vertex 
of Q is connected to vertices a and b, and the extremities and of Q satisfy 
1) rj and are not connected, and 2) (r-j n a) — b 7^ 0, and 3) (rj n 6) — a ^ 0. 
and 4) Q does not contain any smaller subpath satisfying conditions 1) and 2) 
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and 3). These conditions are necessary and sufficient for the set {a, b} U Q to 
form the rows of an induced subgraph of the form V, and this set cannot contain 
a smaller MCS since such a MCS would be: 

• either a MCS of size 3 including a or b, 

• or a MCS of type II or III necessarily including a or b as kernel, 

• or a MCS of type IV and size larger than 3 having a or 6 as kernel. 

• or a MCS of type IV and size larger than 3 having a and b as kernels. 

The 3 cases are impossible, since they would induce a chord from the set Eab U 
E a U E\, in the chordless cycle induced by Q in the graph D. 

The correctness of Algorithm Check_V p follows immediately from the cor- 
rectness of Algorithm Chcck_V. 

Algorithm Chcck_V calls Algorithm ChcckJ. Both algorithms have the 
same time complexity in 0(m 5 ) time. It follows immediately that Algorithm 
CheckJVp runs in Onm 7 time. □ 
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